Picture for Hongyuan Lu

Hongyuan Lu

From Abstract to Contextual: What LLMs Still Cannot Do in Mathematics

Add code
Jan 30, 2026
Viaarxiv icon

VERHallu: Evaluating and Mitigating Event Relation Hallucination in Video Large Language Models

Add code
Jan 15, 2026
Viaarxiv icon

Stephanie2: Thinking, Waiting, and Making Decisions Like Humans in Step-by-Step AI Social Chat

Add code
Jan 09, 2026
Viaarxiv icon

LNE-Blocking: An Efficient Framework for Contamination Mitigation Evaluation on Large Language Models

Add code
Sep 18, 2025
Viaarxiv icon

Dictionary Insertion Prompting for Multilingual Reasoning on Multilingual Large Language Models

Add code
Nov 02, 2024
Figure 1 for Dictionary Insertion Prompting for Multilingual Reasoning on Multilingual Large Language Models
Figure 2 for Dictionary Insertion Prompting for Multilingual Reasoning on Multilingual Large Language Models
Figure 3 for Dictionary Insertion Prompting for Multilingual Reasoning on Multilingual Large Language Models
Figure 4 for Dictionary Insertion Prompting for Multilingual Reasoning on Multilingual Large Language Models
Viaarxiv icon

Clean Evaluations on Contaminated Visual Language Models

Add code
Oct 09, 2024
Figure 1 for Clean Evaluations on Contaminated Visual Language Models
Figure 2 for Clean Evaluations on Contaminated Visual Language Models
Figure 3 for Clean Evaluations on Contaminated Visual Language Models
Figure 4 for Clean Evaluations on Contaminated Visual Language Models
Viaarxiv icon

Toxic Subword Pruning for Dialogue Response Generation on Large Language Models

Add code
Oct 05, 2024
Figure 1 for Toxic Subword Pruning for Dialogue Response Generation on Large Language Models
Figure 2 for Toxic Subword Pruning for Dialogue Response Generation on Large Language Models
Figure 3 for Toxic Subword Pruning for Dialogue Response Generation on Large Language Models
Figure 4 for Toxic Subword Pruning for Dialogue Response Generation on Large Language Models
Viaarxiv icon

Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations

Add code
Jul 04, 2024
Figure 1 for Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations
Figure 2 for Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations
Figure 3 for Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations
Figure 4 for Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations
Viaarxiv icon

Unveiling the Generalization Power of Fine-Tuned Large Language Models

Add code
Mar 14, 2024
Figure 1 for Unveiling the Generalization Power of Fine-Tuned Large Language Models
Figure 2 for Unveiling the Generalization Power of Fine-Tuned Large Language Models
Figure 3 for Unveiling the Generalization Power of Fine-Tuned Large Language Models
Figure 4 for Unveiling the Generalization Power of Fine-Tuned Large Language Models
Viaarxiv icon

Consecutive Model Editing with Batch alongside HooK Layers

Add code
Mar 08, 2024
Figure 1 for Consecutive Model Editing with Batch alongside HooK Layers
Figure 2 for Consecutive Model Editing with Batch alongside HooK Layers
Figure 3 for Consecutive Model Editing with Batch alongside HooK Layers
Figure 4 for Consecutive Model Editing with Batch alongside HooK Layers
Viaarxiv icon